Combining Synthetic and Observed Data to Enhance Machine Learning Model Performance for Streamflow Prediction

نویسندگان

چکیده

Machine learning (ML) models have been shown to be valuable tools employed for streamflow prediction, reporting considerable accuracy and demonstrating their potential part of early warning systems mitigate flood impacts. However, one the main drawbacks these is low precision high values extrapolation, which are precisely ones related floods. Moreover, great majority evaluated considering all data equally relevant, regardless imbalanced nature records, where proportion small but most important. Consequently, this study tackles issues by adding synthetic observed training set a regression-enhanced random forest model increase number introduce extrapolated cases. The generated with physically based Iber precipitations different return periods. To contrast results, compared only fed data. performance evaluation primarily focused on using scalar errors, graphically errors event, taking into account precision, over- underestimation, cost-sensitivity analysis. results show improvement in trained combination respect observed-data regarding values, root mean squared error percentage bias decrease 23.1% 38.7%, respectively, larger than three years period. utility increases 10.5%. suggest that addition precipitation events existing records might lead further improvements models.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the Performance of Machine Learning Algorithms for Heart Disease Diagnosis by Optimizing Data and Features

Heart is one of the most important members of the body, and heart disease is the major cause of death in the world and Iran. This is why the early/on time diagnosis is one of the significant basics for preventing and reducing deaths of this disease. So far, many studies have been done on heart disease with the aim of prediction, diagnosis, and treatment. However, most of them have been mostly f...

متن کامل

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Combining Data Mining and Machine Learning for Effective Fraud Detection

This paper describes the automatic design of methods for detecting fraudulent behavior. Much of the design is accomplished using a series of machine learning methods. In particular, we combine data mining and constructive induction with more standard machine learning techniques to design methods for detecting fraudulent usage of cellular telephones based on profiling customer behavior. Specific...

متن کامل

Combining Data Mining and Machine Learning for Effective User Profiling

This paper describes the automatic design of methods for detecting fraudulent behavior. Much of the de&,, ic nrrnm,-,li~h~rl ,,&,a n .am.L~ nf mn.-h;na lm..~:~~ e-. .. ..--..*.*yYYA’“.. UY.“b Y UISLUY “I III-Yllr IxuIY11~ methods. In particular, we combine data mining and constructive induction with more standard machine learning techniques to design methods for detecting fraudulent usage of ce...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Water

سال: 2023

ISSN: ['2073-4441']

DOI: https://doi.org/10.3390/w15112020